Machine Learning Engineer - Vision-Language Models
Are you ready to take cutting-edge AI research from the lab to the living room? Our customer is revolutionizing home automation with a multi-purpose household robot designed to handle real, everyday tasks. We're looking for a talented ML/Research Engineer who’s passionate about turning research into robust, scalable products that make life easier.
In this role, you will leverage your expertise in vision-language models to enable seamless interactions between the robot and its environment, improving its ability to recognize, understand, and respond to complex, real-world tasks. This isn’t about academic publication - it's about creating high-impact, practical solutions that drive performance and reliability in a home setting.
You will develop, optimize, and deploy vision-language models that enhance robotic capabilities for object recognition, spatial understanding, and task interpretation and collaborate with cross-functional teams (mechanical engineering, product design, software engineering) to align AI functionality with real-world use cases.
Experience Required
- MS or PhD from a top-tier Universit
- Proven background in applied AI, particularly in vision-language models (e.g., CLIP, BLIP, Vision Transformers).
- Strong programming skills in Python, experience with deep learning frameworks (PyTorch, TensorFlow), and familiarity with data pipeline management.
- Experience working at top AI Labs, Tech Orgs or Autonomy Innovators - FAIR Labs, NVIDIA, Covariant, Cruise, Waymo as a few examples.
Interested in learning more? Please reach out ASAP!